Overview

Dataset Statistics

Number of Variables 12
Number of Rows 627
Missing Cells 555
Missing Cells (%) 7.4%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 276.8 KB
Average Row Size in Memory 452.1 B
Variable Types
  • Categorical: 6
  • Numerical: 6

Dataset Insights

diasemchuva has 185 (29.51%) missing values Missing
precipitacao has 185 (29.51%) missing values Missing
riscofogo has 185 (29.51%) missing values Missing
diasemchuva is skewed Skewed
precipitacao is skewed Skewed
riscofogo is skewed Skewed
latitude is skewed Skewed
frp is skewed Skewed
municipio has a high cardinality: 227 distinct values High Cardinality
satelite has constant value "AQUA_M-T" Constant
pais has constant value "Brasil" Constant
datahora has constant length 19 Constant Length
satelite has constant length 8 Constant Length
pais has constant length 6 Constant Length
latitude has 625 (99.68%) negatives Negatives
longitude has 627 (100.0%) negatives Negatives
precipitacao has 402 (64.11%) zeros Zeros
  • 1
  • 2

Variables

datahora

categorical

Approximate Distinct Count 8
Approximate Unique (%) 1.3%
Missing 0
Missing (%) 0.0%
Memory Size 51.4 KB
  • The largest value (2021/06/26 17:00:00) is over 4.63 times larger than the second largest value (2021/06/27 17:45:00)

Length

Mean 19
Standard Deviation 0
Median 19
Minimum 19
Maximum 19

Sample

1st row 2021/06/26 17:00:0...
2nd row 2021/06/26 17:00:0...
3rd row 2021/06/26 17:00:0...
4th row 2021/06/26 17:00:0...
5th row 2021/06/26 17:00:0...

Letter

Count 0
Lowercase Letter 0
Space Separator 627
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 8778
  • The top 2 categories (2021/06/26 17:00:00, 2021/06/27 17:45:00) take over 50.0%
  • datahora has words of constant length

satelite

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 44.7 KB

Length

Mean 8
Standard Deviation 0
Median 8
Minimum 8
Maximum 8

Sample

1st row AQUA_M-T
2nd row AQUA_M-T
3rd row AQUA_M-T
4th row AQUA_M-T
5th row AQUA_M-T

Letter

Count 3762
Lowercase Letter 0
Space Separator 0
Uppercase Letter 3762
Dash Punctuation 627
Decimal Number 0
  • satelite has words of constant length

pais

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 43.5 KB

Length

Mean 6
Standard Deviation 0
Median 6
Minimum 6
Maximum 6

Sample

1st row Brasil
2nd row Brasil
3rd row Brasil
4th row Brasil
5th row Brasil

Letter

Count 3762
Lowercase Letter 3135
Space Separator 0
Uppercase Letter 627
Dash Punctuation 0
Decimal Number 0
  • pais has words of constant length

estado

categorical

Approximate Distinct Count 21
Approximate Unique (%) 3.3%
Missing 0
Missing (%) 0.0%
Memory Size 45.5 KB
  • The largest value (MATO GROSSO) is over 1.94 times larger than the second largest value (TOCANTINS)

Length

Mean 9.2951
Standard Deviation 2.8955
Median 9
Minimum 4
Maximum 19

Sample

1st row RIO DE JANEIRO
2nd row SAO PAULO
3rd row MATO GROSSO DO SUL
4th row SAO PAULO
5th row RIO DE JANEIRO

Letter

Count 5468
Lowercase Letter 0
Space Separator 360
Uppercase Letter 5468
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (MATO GROSSO, TOCANTINS) take over 50.0%

municipio

categorical

Approximate Distinct Count 227
Approximate Unique (%) 36.2%
Missing 0
Missing (%) 0.0%
Memory Size 47.0 KB

Length

Mean 11.7081
Standard Deviation 5.0593
Median 11
Minimum 4
Maximum 27

Sample

1st row SANTO ANTONIO DE P...
2nd row IBITINGA
3rd row ANTONIO JOAO
4th row SANTA BARBARA D'OE...
5th row RIO DE JANEIRO

Letter

Count 6829
Lowercase Letter 0
Space Separator 507
Uppercase Letter 6829
Dash Punctuation 2
Decimal Number 0
  • The largest value (do) is over 3.18 times larger than the second largest value (araguaia)

bioma

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.8%
Missing 0
Missing (%) 0.0%
Memory Size 44.6 KB
  • The largest value (Cerrado) is over 1.82 times larger than the second largest value (Amazonia)

Length

Mean 7.8724
Standard Deviation 1.7694
Median 7
Minimum 7
Maximum 14

Sample

1st row Mata Atlantica
2nd row Cerrado
3rd row Cerrado
4th row Mata Atlantica
5th row Mata Atlantica

Letter

Count 4891
Lowercase Letter 4219
Space Separator 45
Uppercase Letter 672
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Cerrado, Amazonia) take over 50.0%
  • The largest value (cerrado) is over 1.82 times larger than the second largest value (amazonia)

diasemchuva

numerical

Approximate Distinct Count 54
Approximate Unique (%) 12.2%
Missing 185
Missing (%) 29.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 6.9 KB
Mean 21.8959
Minimum -999
Maximum 59
Zeros 8
Zeros (%) 1.3%
Negatives 1
Negatives (%) 0.2%
  • diasemchuva is skewed left (γ1 = -18.7443)

Quantile Statistics

Minimum -999
5-th Percentile 2
Q1 14
Median 24
Q3 37
95-th Percentile 42
Maximum 59
Range 1058
IQR 23

Descriptive Statistics

Mean 21.8959
Standard Deviation 50.5023
Variance 2550.4789
Sum 9678
Skewness -18.7443
Kurtosis 376.5236
Coefficient of Variation 2.3065
  • diasemchuva is not normally distributed (p-value 5.836571674091865e-16)
  • diasemchuva has 1 outliers

precipitacao

numerical

Approximate Distinct Count 13
Approximate Unique (%) 2.9%
Missing 185
Missing (%) 29.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 6.9 KB
Mean 0.06109
Minimum 0
Maximum 3.2
Zeros 402
Zeros (%) 64.1%
Negatives 0
Negatives (%) 0.0%
  • precipitacao is skewed right (γ1 = 7.3976)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0.2
Maximum 3.2
Range 3.2
IQR 0

Descriptive Statistics

Mean 0.06109
Standard Deviation 0.3513
Variance 0.1234
Sum 27
Skewness 7.3976
Kurtosis 55.5204
Coefficient of Variation 5.7502
  • precipitacao is not normally distributed (p-value 4.636568971665533e-25)
  • precipitacao has 40 outliers

riscofogo

numerical

Approximate Distinct Count 12
Approximate Unique (%) 2.7%
Missing 185
Missing (%) 29.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 6.9 KB
Mean -10.4324
Minimum -999
Maximum 1
Zeros 5
Zeros (%) 0.8%
Negatives 5
Negatives (%) 0.8%
  • riscofogo is skewed left (γ1 = -9.2418)

Quantile Statistics

Minimum -999
5-th Percentile 0.3
Q1 0.8
Median 1
Q3 1
95-th Percentile 1
Maximum 1
Range 1000
IQR 0.2

Descriptive Statistics

Mean -10.4324
Standard Deviation 105.8628
Variance 11206.9344
Sum -4611.1
Skewness -9.2418
Kurtosis 83.4107
Coefficient of Variation -10.1475
  • riscofogo is not normally distributed (p-value 4.257058093159736e-25)
  • riscofogo has 34 outliers

latitude

numerical

Approximate Distinct Count 604
Approximate Unique (%) 96.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9.8 KB
Mean -12.3042
Minimum -25.865
Maximum 3.875
Zeros 0
Zeros (%) 0.0%
Negatives 625
Negatives (%) 99.7%
  • latitude is skewed left (γ1 = -0.2988)

Quantile Statistics

Minimum -25.865
5-th Percentile -20.9886
Q1 -14.6955
Median -11.747
Q3 -9.6325
95-th Percentile -4.8666
Maximum 3.875
Range 29.74
IQR 5.063

Descriptive Statistics

Mean -12.3042
Standard Deviation 4.8643
Variance 23.6613
Sum -7714.761
Skewness -0.2988
Kurtosis 0.1036
Coefficient of Variation -0.3953
  • latitude is not normally distributed (p-value 1.2490297092640167e-07)
  • latitude has 28 outliers

longitude

numerical

Approximate Distinct Count 614
Approximate Unique (%) 97.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9.8 KB
Mean -50.2646
Minimum -72.316
Maximum -35.586
Zeros 0
Zeros (%) 0.0%
Negatives 627
Negatives (%) 100.0%
  • longitude is skewed left (γ1 = -0.561)

Quantile Statistics

Minimum -72.316
5-th Percentile -59.5544
Q1 -53.8465
Median -49.718
Q3 -46.9365
95-th Percentile -42.6473
Maximum -35.586
Range 36.73
IQR 6.91

Descriptive Statistics

Mean -50.2646
Standard Deviation 5.4547
Variance 29.7541
Sum -31515.931
Skewness -0.561
Kurtosis 1.0623
Coefficient of Variation -0.1085
  • longitude is not normally distributed (p-value 0.00028657980267450815)
  • longitude has 13 outliers

frp

numerical

Approximate Distinct Count 423
Approximate Unique (%) 67.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9.8 KB
Mean 74.7576
Minimum 4.6
Maximum 5013.6
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • frp is skewed right (γ1 = 12.8612)

Quantile Statistics

Minimum 4.6
5-th Percentile 7.43
Q1 14.45
Median 24.2
Q3 54.8
95-th Percentile 213
Maximum 5013.6
Range 5009
IQR 40.35

Descriptive Statistics

Mean 74.7576
Standard Deviation 279.6282
Variance 78191.937
Sum 46873
Skewness 12.8612
Kurtosis 197.064
Coefficient of Variation 3.7405
  • frp is not normally distributed (p-value 5.35654515877293e-25)
  • frp has 65 outliers

Interactions

Correlations

Missing Values